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Structural studies of proteins normally require large quantities of pure material that can only be obtained 
through heterologous expression systems and recombinant technique. In these procedures, large amounts 
of expressed protein are often found in the insoluble fraction, making protein purification from the 
soluble fraction inefficient, laborious, and costly. Usually, protein refolding is avoided due to a lack of 
experimental assays that can validate correct folding and that can compare the conformational population to 
that of the soluble fraction. Herein, we propose a validation method using simple and rapid ID 'H nuclear 
magnetic resonance (NMR) spectra that can efficiently compare protein samples, including individual 
information of the environment of each proton in the structure. 

Heterologous expression of proteins in Escherichia coli is the most widespread technique used to produce 
large amounts of protein for structural studies. However, a sufficient quantity of protein is not always 
obtained using common, native, non-denaturant protocols. Major problems associated with rapid and 
high-level expression in bacteria include protein aggregation and formation of inclusion bodies 1 . In some 
structural genomics projects 2,3 , approximately 40% of the expressed targets are found in the insoluble fraction. 
Although these proteins are usually discarded as viable targets, the use of a simple refolding protocol could resolve 
the problem. Often, the fraction of soluble protein is not sufficient to make the production of large amounts of 
protein viable, and, in these cases, use of a refolding protocol can be the best solution to increase protein yield. 
Several techniques have been applied to refold proteins 46 , but the problem of how to solubilize and correctly fold 
proteins remains a major issue. One of the main difficulties in using refolded proteins is that there are very few 
experiments that discriminate between different conformational states and guarantee the correct refolding of the 
protein. Enzymatic 7-10 and biological activity 1113 have been widely used to confirm correct protein refolding. 
However, in general, significant statistical error is associated with these assays, and the structural integrity of the 
refolded sample cannot be ensured. With respect to direct analysis of protein structure, circular dichroism (CD) is 
an effective technique only when large changes in secondary structure are present. 

High resolution structural studies can be hindered by low yields of expressed protein or lack of an optimized 
production/purification protocol. For example, obtaining a 3D nuclear magnetic resonance (NMR) spectrum of a 
20 kDa protein typically requires 5-10 mg of very pure and high cost isotopically-labeled protein. In x-ray 
crystallography, large amounts of very concentrated protein are required to test crystallization conditions. In 
addition, protein purity alone is not always sufficient to ensure successful crystal growth. Even small changes in a 
protein's three-dimensional structure can disturb crystal formation 1416 . If an alternative conformation or oli- 
gomer attaches to the crystal, it may form a crystal surface that prevents attachment of other protein molecules, 
poisoning its growth 17,18 . Proteins can exist as multiple conformations in equilibrium due to many factors, such as 
structural flexibility, enzymatic activity, cooperativity, and oligomerization. For some experiments, conforma- 
tional purity may not be important. For example, a biological activity assay of a protein with 20% in an inactive 
conformation is equivalent to another experiment with 20% less protein, but with 100% in an active conforma- 
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tion. However, in crystallography, NMR, and many other types of 
studies, this information may be relevant and necessary. A simple 
technique that can compare and ensure that different protein sam- 
ples, obtained from different purification protocols, have the same 
conformational distribution is helpful in such cases. 

In this paper, we suggest the use of a simple and fast ID 'H NMR 
spectrum to validate refolding protocols before increasing protein 
production. This technique is able to detect very small conforma- 
tional changes in protein structure, as 'H NMR is sensitive to changes 
in the environment of each hydrogen atom of a molecule (see e.g. 19 ). 
It represents a powerful method for the verification of the spatial 
structure of (macro)molecules 20 and gives us the expected informa- 
tion, and these properties are unique and essential for subsequent 
structural studies of the analyzed sample. 

We applied the proposed method to P21 from Trypanosoma cruzi, 
an 18 KDa protein that contains six cysteine residues that are all 
expected to form disulfide bonds. P2 1 is a ubiquitous secreted protein 
of T. cruzi, the causative parasite of Chagas disease. P21 was found to 
serve as part of host cell invasion mechanism 21 , triggering actin poly- 
merization on the host cell via interactions with the CXCR4 chemo- 
kine receptor 22 , which is present on the cellular membrane. Previous 
studies 21 showed that recombinant P21 (P21-His 6 , with an N- 
terminal histidine tag) adheres to HeLa cells in a dose-dependent 
manner. Additionally, invasion of HeLa cells by T. cruzi metacyclic 
trypomastigotes (tissue cultured trypomastigotes, TCT) and extra- 
cellular amastigotes (EA) is greatly increased in the presence of 
P21-His 6 . Furthermore, P21-His 6 upregulates the phagocytosis of 
zymosan particles in mice peritoneal macrophages 22 , indicating that 
P21 is involved in the phagocytic mechanism. More than 95% of 
expressed P21-His 6 is insoluble in Escherichia coli, making the yield 
of the soluble protein insufficient for biological and structural stud- 
ies. To overcome this limitation, we extracted P21-His 6 from inclu- 
sion bodies using urea and refolded it. Since recombinant P21 is 
fused to a histidine tag, we used nickel affinity chromatography to 
refold the protein by applying a slow urea gradient. This technique 
has been used successfully to refold proteins in many cases 23-27 
because protein absorption on the solid matrix prevents aggregation 
during the process through separation of individual protein mole- 
cules 5 . To assess the available refolding validation techniques, we 
compared parasite invasion and phagocytosis of particles induced 
by refolded (rP21-His 6 ) and soluble (sP21-His 6 ) protein. As a second 
approach, refolded and soluble protein structures were compared by 
circular dichroism (CD) to find information regarding the state of 
the proteins' secondary structures. Finally, ID 'H NMR spectra of the 
soluble and refolded proteins were compared in order to confirm the 
correct refolding of P21-His 6 at atomic resolution. Comparison of 
conformational identity between the soluble and refolded prepara- 
tions is an essential analysis needed to perform structural studies 
using the refolded material, and NMR spectroscopy is the only one 
of these methods that addresses that question. 

Results 

The yield of P21-His 6 was greatly improved when the protein was 
recovered from inclusion bodies and refolded. In order to compare 
the yield of soluble to refolded protein, we expressed recombinant 
P21-His 6 in 8 L 2xYT medium. Expression produced 24.9 g of dry 
weight bacteria and 8.1 g of insoluble pellet after lysis and centrifu- 
gation. Most of the protein was present in the insoluble fraction. 
sP21-His 6 was purified from the soluble fraction in a single step of 
Nickel affinity chromatography; however, only 0.68 mg of purified 
protein was obtained. The refolding procedure recovered 57.76 mg 
of rP21-His 6 from inclusion bodies, a yield 85 times greater than 
purification from the soluble fraction. 

Results from invasion and phagocytosis experiments demon- 
strated that soluble and refolded P21-His 6 have similar biological 
properties. In the presence of sP21-His 6 , invasion of Vero cells by 



TCT (Figure 1 A) was increased approximately 3.6-fold, and invasion 
by EA (Figure IB) was increased approximately 2.4-fold. Similar 
results were obtained for rP21-His 6 (4.4-fold increase for TCT and 
2.3-fold increase for EA). The difference between sP21-His 6 and 
rP21-His 6 was not significant. Phagocytosis of zymosan particles 
by macrophages was increased 3.5-fold with the addition of 40 ug/ 
mL sP21-His 6 (Figure 1C) and 3.5-fold with the addition of 40 ug/ 
mL rP21-His 6 22 . The similarity of these increases suggests that 
refolded and soluble protein exhibit the same biological properties. 

Finally, CD and NMR experiments validated the correct refolding 
of P21-His 6 . The CD spectra showed equivalent secondary structure 
composition for both preparations (Figure 2B). Comparison between 
'H NMR spectra of sP21-His 6 and rP21-His 6 showed exactly the 
same chemical shifts and intensities for every proton in the structure 
(Figure 2C). In this way, both samples could be compared directly 
and were found to be identical to one another, with respect to both 
correct folding and conformational ensemble. To better evaluate the 
similarities (or differences) between the spectra of both samples, a 
difference spectrum was calculated and is shown in Figure 2C (green 
line). The larger differences can be assigned to non-protein compo- 
nents of the sample (residual ethanol from concentrator rinsing, 
small differences in DSS concentration, low molecular weight con- 
taminants). These differences underscore the sensitivity of the 
method and the identity of both protein conformations and ensem- 
bles. In addition, up field shifts around 0 ppm in both spectra indi- 
cate a hydrophobic nucleus in P21-His 6 , showing that the structure is 
folded. 

Discussion 

In this study, we applied a one-step refolding procedure to P21-His 6 , 
which increased the yield of the protein 85-fold. This refolding 
method is based on immobilization of the P21-His 6 on Nickel-affin- 
ity resin, preventing protein aggregation during the process. 
Refolding requires approximately 14 hours (which can be performed 
overnight with no need for human interference) and uses less than 
0.5 L of buffer. It is less laborious and quicker than other refolding 
techniques, such as dialysis. 

We used pre-existing biological information about P21 to develop 
protein- specific assays that could be applied to both refolded and 
soluble P21-His 6 fractions. Both samples increased parasite invasion 
and phagocytosis of zymosan particles at similar rates, suggesting 
that they behaved identically with respect to biological activity. This 
information would be sufficient in a majority of biological studies, 
but is not sufficient for structural studies, in which correct folding 
and conformational homogeneity of samples are essential. 

As more structure-focused approaches, CD and NMR spectra 
provided structural comparisons between sP21-His 6 and rP21- 
His 6 . Although CD measurements provide information about pro- 
tein secondary structure, no information at an atomic resolution can 
be provided. This method is more useful for identifying incorrectly 
folded proteins by detecting differences in the spectra, because even if 
two spectra are identical, we cannot confirm that structures are equal. 
Simple one-dimensional 'H NMR overcomes this limitation by pro- 
viding information about the environment of each 'H atom of the 
protein, allowing a comparison of samples obtained by different 
purification methods at atomic resolution. With this simple ID 'H 
NMR measurement, we confirmed that sP21-His 6 and rP21-His 6 
have exactly the same structures and identical conformational equi- 
libria. In addition, more important information can be obtained 
from the spectrum: the good dispersion of the peaks over the ppm 
scale indicates the presence of secondary structure, and the existence 
of high-field shifted peaks around the reference (DSS at 0 ppm) 
indicates the presence of a hydrophobic core in the protein. 
Therefore, the structural and the biological data were in agreement, 
as expected. 
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Figure 1 | (A) sP21-His 6 and rP21-His 6 affect invasion of Vero cells by T. cruzi TCT and EA. Light microscopy of (a) TCT invasion and (B) EA invasion 
after treatment with (1) DMEM (control, not treated), (2) 40 ug/mL of rP21-His 6 and (3) 40 ug/mL of sP21-His 6 . (4) Quantification. Arrows in (1), (2), 
and (3) indicate internalized parasites. (C) Phagocytosis of zymosan particles by mice peritoneal macrophages in the presence of sP21-His 6 . Light 
microscopy of (1) control (untreated) cells and (2) cells treated with 40 |rg/mL sP2 1 -His 6 . (3) Quantification of ( 1 ) and (2). Arrows in (1) and (2) indicate 
internalized particles. 



We propose the use of simple and fast ID 'H NMR spectra to 
validate refolding protocols before increasing protein production. 
This technique is able to detect very small conformational changes 
in protein structure, as 'H NMR is sensitive to changes in the envir- 
onment of each hydrogen atom of a molecule. By searching the 
REFOLD database 28 , we found that 34 of 1001 refolded proteins used 
this NMR spectroscopy in some way. In the majority of cases, the 
studies were driven by NMR, and the refolded protein was produced 
and isotopically labeled for analysis. Only three proteins from the 
database used simple ID 'H NMR spectra, two of them for visual 
inspection 29,30 , and one for comparison with the natural modified 
protein 31 . In a recent study 32 , the authors compared the ID 'H 
NMR spectra from the refolded protein and the protein purified from 
soluble fraction. They concluded that the structures were exactly the 
same because the spectra were identical (no spectral difference was 
presented). In a recent review 33 focused on tools used to evaluate 
protein conformations, NMR is briefly mentioned as one of the 
leading techniques of structural biology. The authors said that 
NMR "still requires very sophisticated equipment, appreciable 
amounts of sample, and specialist human resources that keep them 
away from the pipeline in protein production and control. Newer 
techniques (or new applications of well-established techniques) are 
being developed to complement these traditional biophysical assays 
and to provide information on the success of protein folding." We 
want to stress that the proposed method uses the detailed informa- 
tion present in NMR spectra in a simple way, to answer a simple 
question. If NMR (as x-ray crystallography) is analyzed as a tool to 
determine three-dimensional protein structure, then it presents the 



difficulties pointed out in the aforementioned review; however, this is 
not the case for the ID 'H spectra needed for conformational com- 
parisons of two samples. 

Conclusion 

One-dimensional homonuclear 'H NMR spectroscopy requires no 
special sample production and little time for measurement. 
Typically, a high quality spectrum of a 500 \iL protein sample 
(—100 |iM of a small protein) on a 600 MHz spectrometer can be 
obtained in less than 30 minutes. Most of the time is spent on adjust- 
ment of the equipment and definition of experimental parameters. 
Using spectrometers with a cryoprobe is desirable, but not required. 
Spectra processing and comparison is very quick and can be per- 
formed immediately after acquisition using spectrometer software. 

Finally, simple precautions should be taken for the preparation of 
NMR samples. The NMR buffer should be of neutral to acidic pH and 
low ionic strength. High pH results in less protonated amides, caus- 
ing loss of signal. Low salt concentration (up to 300 mM) is required 
to minimize radio wave reflections during NMR pulses that can cause 
heating of the sample (tuning and matching of the spectrometer 
probe head). 'H NMR is very sensitive to non-water exchangeable 
hydrogens of small molecules, and, as a consequence, the use of 
hydrogen-containing buffer agents (like Tris, Acetate, MES, and 
HEPES) and some other molecules (glycerin, DTT, etc.) should be 
avoided, or these reagents can be replaced by their deuterated equiva- 
lents. Sodium or potassium phosphate buffer is recommended, since 
the hydrogen atoms of these compounds are exchangeable with 
water, resulting in a signal that is coincident with the water resonance 
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Figure 2 | (A) SDS-PAGE of bacterial lysate, showing the predominate expression of P21-His 6 in the insoluble fraction. Lanes: (MW) molecular weight 
markers, (BI) lysate before IPTG induction, (IF) insoluble fraction after induction, and (SF) soluble fraction after induction. Arrow indicates the position 
of P21-His 6 . (B) Circular dichroism (CD) spectra of refolded and soluble P21-His 6 . Superposition of rP21-His 6 (blue) and sP21-His 6 (green) CD spectra 
at 20°C. Refolded and soluble protein samples have similar secondary structure composition. (C) Comparison between 1D ] H NMR spectra of sP21-His 6 
and rP21-His 6 , showing exactly the same structure for both proteins. Spectrum of refolded (blue) and soluble (red) protein; spectra difference after 
normalization (green). The marked peaks indicate (c) low weight contaminants, (d) DSS, and (e) residual ethanol. 



that should be suppressed. Moreover, extensive care with contami- 
nants should be taken, as *H NMR can detect contaminant signals, 
even in small concentrations. 

Methods 

Protein expression and cell lysis. P21-His 6 was expressed in Escherichia coli BL21 
cells transformed with a pET-28a( + ) (Novagen) expression vector, which contains an 
N-terminal His 6 -tag followed by a thrombin recognition site. Expression was carried 
out in 8 L of 2xYT medium (16 g tryptone, 10 g yeast extract, 5 g NaCl per 1 L 
medium) containing 35 ug/mL kanamycin. Bacteria were incubated at 37°C until the 
optical density at 600 nm (OD 600 ) reached —0.7, at which point expression was 
induced by addition of 0.5 mM isopropyl |3-D-l-thiogalactopyranoside (IPTG). 
Cultures were then incubated for an additional 3 hours at 37°C. Cells were harvested 
by centrifugation at 6,000 X g for 40 minutes at 4 C and resuspended in lysis buffer 
(50 mM Tris, 300 mM NaCl, 20 mM imidazole, pH 8.0) containing 1 mg/mL 
lysozyme. The solution was maintained on ice for 30 min before cells were lysed by 
ultrasound (15 cycles of 40 sat 40 W,40 s interval). The lysate solution was subjected 
to centrifugation (14,000 X g for 40 min at 4°C), and both supernatant and pellet 
were used for native and denaturing purification of the protein, respectively. 

Native P21-His 6 purification from the soluble fraction (sP21-His 6 ). The soluble 
fraction was added to 2 mL of Ni-NTA resin (Qiagen) and extensively washed with 
lysis buffer. To remove low-affinity bound proteins, a second wash with wash buffer 
(50 mM Tris, 300 mM NaCl, 50 mM imidazole, pH 8.0) was performed. sP21-His 6 
was eluted with elution buffer I (50 mM Tris, 300 mM NaCl, 160 mM imidazole, pH 
8.0), concentrated, and dialyzed against buffer II (50 mM Tris, 50 mM NaCl, pH 8.0). 

Denaturing P21-His 6 purification and refolding from the insoluble fraction 
(rP21-His 6 ). In order to extract the expressed protein from inclusion bodies, the 
lysate pellet was resuspended in urea buffer (50 mM Tris, 300 mM NaCl, 5 mM 
imidazole, 6 M urea, 5% (v/v) glycerol, pH 8.0) and stirred overnight at room 
temperature. After centrifugation (14,000 X g for 40 min at 4°C), the supernatant, 



containing P21-His 6 extracted from the inclusion bodies, was immobilized in a 
column containing 5 mL Ni-NTA resin fixed on an Akta Purifier system. After 
extensive washing with urea buffer, the urea concentration was slowly decreased with 
a linear buffer gradient ( — 7 mM/min of urea, 0.5 mL/min) 23 . To remove glycerol and 
additional impurities, the column was washed with wash buffer until no protein signal 
was detected. rP21-His 6 was eluted with elution buffer II (50 mM Tris, 300 mM 
NaCl, 220 mM imidazole, pH 8.0), concentrated, and dialyzed against buffer II (see 
above). 

Cell lines and cultures. Vero cells were cultured in Dulbecco's Modified Eagle 
Medium (DMEM) supplemented with 10% fetal bovine serum (FBS, Cultilab), 
10 mg/mL streptomycin (Sigma), 100 U/mL penicillin (Sigma), and 40 mg/mL 
gentamycin (Sigma) at 37°C in a humidified atmosphere containing 5% C0 2 - 
Peritoneal macrophages from BALB/C mice were harvested from the peritoneal 
cavity in 5 mL DMEM. Mice were stimulated with 1 mL 3% thioglycollate medium 3 
days prior to harvest. 

Parasites. To generate tissue -cultured trypomastigotes (TCT) from the CL strain, 
confluent monolayers of Vero cells were infected with metacyclic trypomastigotes 
and maintained for at least two weeks to establish the intracellular cycle. Extracellular 
amastigotes (EA) were derived from trypomastigotes by axenic differentiation. 
Infected Vero cell cultures were subjected to centrifugation (2500 X g, 5 minutes), 
and the pellet was resuspended in liver infusion tryptose (LIT) medium. Finally, the 
resuspension was incubated for 18 hours at 37°C to generate at least 95% pure 
extracellular amastigotes. 

Animals and ethics. Male or female BALB/c mice were six to eight weeks old and 
were maintained under standard conditions on a 12 hour light, 12 hour dark cycle in a 
temperature controlled room (25 ± 2 C) with food and water ad libitum. All 
experimental protocols were approved by the Ethics Committee from Universidade 
Federal de Uberlandia. Moreover, maintenance and care of these animals complied 
with the guidelines of the Laboratory Animal Ethics Committee from the Institution. 
Animal euthanasia was performed in accordance with international welfare grounds, 
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according to the American Veterinary Medical Association Guidelines on 
Euthanasia. 

Host cell invasion assay. A volume of 500 uL Vero cell suspension (1.5 X 10 5 cells) 
was added to multiwell plates containing sterile glass coverslips and incubated 
overnight. The invasion assay was performed by adding EA (20 parasites per cell) or 
TCT (10 parasites per cell) to plates in the presence or absence of 40 ug/mLsP21-His 6 
or 40 (ig/mL rP21-His 6 . The plates were incubated for 2 hours at 37°C in a humidified 
incubator containing 5% C0 2 . After incubation, the cells were gently washed three 
times with PBS, fixed with Bouin's solution, and stained with Giemsa. 

Phagocytosis assay. Zymosan particles (10 mg/mL, Sigma) were resuspended in 
sterile PBS, sonicated for 15 minutes, and pelleted by centrifugation. To determine 
particle concentration, a Neubauer chamber was used for counting. Resident 
peritoneal macrophages were seeded in cell culture well plates (5 X 10 5 cells/well) 
containing 13 mm round glass coverslips and were then incubated overnight in a 
humidified atmosphere containing 5% C0 2 at 37°C to allow the cells to adhere. Non- 
adherent cells were removed by washing with PBS before adding DMEM containing 
10% FBS. To perform the phagocytosis assay, zymosan particles were added to wells 
(20 particles/each macrophage fixed) in the presence or absence of 40 ug/mL sP21- 
His 6 . Particles were incubated with macrophages for 2 hours. The cells were then 
washed with PBS, fixed in Bouin's solution, and stained with Giemsa. Results were 
expressed as the number of internalized particles/100 cells. These results were 
compared to those previously determined for rP21-His 6 using the same protocol 22 . 

Circular dichroism spectropolarimetry (CD). Spectra were recorded on a Jasco 815 
CD spectrometer using 0.1 cm thick quartz cells. Samples containing 0.15 mg/mL 
sP21-His 6 or rP21-His 6 in 10 mM sodium phosphate buffer (pH 7.4) were analyzed at 
20°C to assess structural similarity. 

Nuclear magnetic resonance spectroscopy (NMR). Samples of sP21-His 6 and rP21- 
His 6 were concentrated to the same concentration (1.3 mg/mL, 70 uM in a final 
volume of 500 uL) and then simultaneously dialyzed against 2 L NMR buffer 
(50 mM potassium phosphate, 150 mM KC1, pH 6.0) at 4°C. The NMR buffer was 
changed twice to ensure that the samples were in exactly the same buffer conditions 
for NMR measurements, as l H signals are very sensitive to changes in their 
environment. DSS (100 uM, for spectrum referencing) and D 2 0 (5% v/v), for tuning 
and matching the spectrometer probe head) were added to both samples. ID 1 H 
spectra were recorded at 303 K in a high-resolution, small bore 600 MHz Bruker 
Avance III spectrometer. Since the spectrometer was equipped with a TCI cryoprobe, 
a 4X enhancement in the signal intensity was achieved. The intense water signal 
around 4.8 ppm, typical for an aqueous protein sample, was suppressed using a 
standard Watergate W5 pulse sequence. To ensure spectra with a good signal to noise 
ratio were obtained for the diluted samples, each ID experiment was acquired with a 
total of 256 scans, resulting in an experimental time of approximately 17 minutes per 
spectrum. Proton chemical shifts were referenced (0 ppm) to the l H resonance 
frequency of the sharp and intense signal of the methyl groups in DSS. Time-domain 
data were acquired and processed with an exponential line broadening of 1 Hz using 
the Bruker BioSpin spectrometer control software TopSpin 3.1. The same software 
was used for the spectra analysis and for obtaining the difference spectrum. 
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